Sampling distribution of the SAM statistic
نویسنده
چکیده
1 Background We derive the sampling distribution of the SAM (significance analysis for microarrays) statistic introduced by Tusher et al. (2001). The statistic resembles the t-statistic, but has an additional regularizing term in the denominator, which is a constant for a given dataset. We first introduce some notation and the SAM statistic, and then proceed to derive an approximate sampling distribution. Notation Consider log-transformed microarray data D in two classes: X i and Y j are taken to be i.i.d. random variables drawn according to Normal distributions N (µ X , σ 2 X) and N (µ Y , σ 2 Y) respectively. That is, µ X and µ Y are the true (or population) class means and σ 2 X and σ 2 Y the corresponding variances. Let ¯ X and ¯ Y represent the sample means of data in the two classes: ¯ X = 1 m m i=1
منابع مشابه
On the Use of Permutation in and the Performance of A Class of Nonparametric Methods to Detect Differential Gene Expression
MOTIVATION Recently a class of nonparametric statistical methods, including the empirical Bayes (EB) method, the significance analysis of microarray (SAM) method and the mixture model method (MMM), have been proposed to detect differential gene expression for replicated microarray experiments conducted under two conditions. All the methods depend on constructing a test statistic Z and a so-call...
متن کاملApproximating the Sampling Distribution of the T2 Statistic
Introduction The sampling distribution of a T statistic is well established when the statistic is based on observations taken from a multivariate normal population (MVN). With known parameters, the sampling distribution is always a chi-square distribution. In a Phase I operation with unknown parameters, where outliers need to be detected, the distribution of the T statistic is a beta distributi...
متن کاملMaking Inferences about Parameters
One must know the sampling distribution of the estimator (the statistic used to estimate I shall use to stand for the statistic used to estimate ) to make full use of the estimator. The sampling distribution of a statistic is the distribution that would be obtained if you repeatedly drew samples of a specified size from a specified population and computed on each sample. In other wo...
متن کاملA fast algorithm for computing the sampling distribution of a statistic from discrete populations
In this work we propose a fast algorithm for computing the exact small sampling distribution of a given statistic, when the population random variable is discrete. The algorithm relies on a recursion on block matrices that describes all possible random samples that can be generated. In this way, the power of modern programming which deÞnes objects in term of matrices is fully exploited for effe...
متن کاملDetecting Outliers in Exponentiated Pareto Distribution
In this paper, we use two statistics for detecting outliers in exponentiated Paretodistribution. These statistics are the extension of the statistics for detecting outliers inexponential and gamma distributions. In fact, we compare the power of our test statisticsbased on the simulation study and identify the better test statistic for detecting outliers inexponentiated Pareto distribution. At t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004